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ABSTRACT 

Shackman,  Kenneth  Alan.   M.S.C.E.,  Purdue  University, 
January,  1972.   The  Prediction  of  Running  Speeds  on  Urban 
Arterial  Streets.   Major  Professor:   Harold  L.  Michael. 

The  purpose  of  this  research  project  was  the  develop- 
ment of  a  model  for  predicting  running  speeds  on  urban 
arterial  streets.   To  supplement  the  model  the  major 
causes  of  stop  delay  and  their  average  values  were  also 
evaluated.   Using  these  results  one  can  estimate  the  change 
in  average  running  speed  and  stop  delay  time  between  an 
existing  facility  and  a  proposed  or  improved  facility. 
The  estimate  of  the  benefits  gained,  in  terms  of  time,  speed, 
or  money,  could  then  be  contrasted  with  the  proposed  cost. 

The  data  collection  was  conducted  in  nine  Indiana 
cities  having  populations  between  thirty  and  one  hundred 
and  fifty  thousand;  the  average  car  method  was  the 
procedure  employed  for  collecting  the  speed-delay  data. 
Pneumatic  tube  traffic  counters  were  installed  to  determine 
the  volumes  using  the  streets  being  studied. 

The  prediction  model  was  formulated  using  stepwise 
regression.   It  utilizes  eleven  roadway  and  traffic 

condition  variables  combined  to  form  ten  terms;  it  has  an 

2 
R  of  0.5926  and  a  standard  error  of  estimate  of  3.417. 


IX 


The  major  influences  of  speed  are: 

1.  The  length  of  the  section 

2.  The  width  of  the  approach 

3.  The  number  of  signalized  intersections 

4.  The  number  of  unsignalized  intersections 

5.  The  number  of  alleys 

6.  The  population  of  the  city 

7.  The  number  of  railroad  crossings 

8.  Whether  or  not  parking  was  allowed  on  the 
driver's  right 

9 .  The  volume 

10.  The  lowest  G/C  ratio  of  the  traffic  signals 
included  in  the  section 

11.  Whether  or  not  good  progression  of  the  traffic 
signals  existed 

The  average  values  of  stop  delay  were  calculated  from 

data  collected  in  the  speed-delay  studies  conducted  in 

nine  cities.   Traffic  signals  were  found  to  be  the  major 

source  of  stop  delays,  contributing  over  eighty  percent  of 

the  total. 


INTRODUCTION 

In  a  recent  publication  (5)  it  was  stated  "that  at 
least  as  many  drivers  selected  the  least  'effort'  routes 
as  the  least  time  routes  to  a  shopping  center."   Just  what 
makes  least  "effort"  routes  so  inviting?   Basically,  three 
elements  of  high  quality  of  service  are  considered: 
1)  safety,  2)  constant  (or  gradually  changing)  direction, 
and  3)  a  constant,  reasonable  speed.   This  particular  study 
was  concerned  with  the  latter  element,  speed. 

Though  a  constant  speed  is  desirable  the  probability 
of  it  occurring  on  an  urban  arterial  street  is  low,  the 
presence  of  traffic  control  devices  and  a  high  volume  of 
traffic  preventing  it.   This  is  true  even  on  streets 
equipped  with  progressive  signal  systems.   With  proper 
traffic  engineering  controls  and  an  adequate  design,  how- 
ever, a  reasonable  average  running  speed  for  this  type  of 
facility  can  be  obtained. 

Average  running  speed  data  are  widely  utilized  to 
determine  benefits  gained  by  improvement  of  the  quality  of 
traffic  service.   This  has  been  accomplished  by  performing 
speed-delay  studies  both  before  and  after  implementation 
of  improvements  and  contrasting  the  running  speeds. 
Examples  of  areas  where  this  has  been  done  include  measuring 


"the  overall  effectiveness  of  traffic  control  devices",  in 
general  (10),  and  traffic  signal  timing  revisions,  in 
particular,  and  modifications  concerning  parking  prohibitions 
development  of  one-way  street  networks,  etc.   In  presenting 
such  accrued  benefits  of  savings  in  time  (or  its  monetary 
value)  and  a  usual  concurrent  reduction  of  accidents,  the 
engineer  has  shown  the  taxpayer,  as  well  as  the  politician, 
the  value  of  traffic  engineering  improvements. 

The  goal  of  this  study  has  been  to  determine  a  means 
of  predicting  the  average  speed  on  sections  of  urban 
arterials  without  performing  speed-delay  studies.   This 
was  accomplished  by  developing  a  model  for  predicting  the 
average  running  speed,  employing  experimentation  and 
analysis  similar  to  Hejal  (7) .   By  estimating  average 
speeds,  the  time  and  manpower  requirements  are  reduced,  as 
well  as  the  cost.   Indirectly,  the  major  causes  of  speed 
reduction  and  their  relative  influence  have  also  been 
determined. 

Past  Research  In  This  Area 
During  the  past  decade  there  has  been  some  research 
in  the  travel  time  or  speed  and  delay  areas.   Basically, 
three  different  approaches  have  been  tried.   They  are: 

1.  Computer  Simulation 

2.  Before  and  After  Studies 

3.  Models  -  Theoretical  and  Empirical 


Computer  simulation  appears  to  be  promising;  however, 
this  approach  has  not  been  perfected  to  the  stage  where 
the  results  are  both  workable  and  representative  of  reality. 
There  are  computer  solutions  for  parts  of  the  problem,  such 
as  predicting  speeds  of  vehicles  passing  through  signalized 
intersections  (1)  and  determining  delaying  effects  of  curb 
parking  maneuvers  on  traffic  (19).   Though  these  are  steps 
in  the  needed  direction,  the  latter,  as  well  as  others,  did 
not  produce  results  representative  of  reality.   It  is 
probable  that  the  development  of  a  realistic  simulation 
program  that  will  encompass  all  aspects  of  traffic  friction 
is  not  in  the  immediate  future.   The  development  of  an 
empirical  procedure,  however,  appears  to  be  a  quickly 
attainable  goal  since  the  formulation  period  would  be 
short  and  the  results  might  be  easily  applicable  and 
reasonably  accurate. 

Edwards  and  Kelcey  (3)  reviewed  the  many  parameters 
of  frictional  effects  on  travel  time.   There  were  many 
variables  involved  in  their  work  in  downtown  areas.   The 
procedure  that  was  followed  was  to  do  "before  and  after 
studies"  for  determining  the  quantitative  effect  of 
individual  improvements.   These  were  statistically  tested 
for  significance  and  a  Network  Assignment  Model  was  applied 
to  determine  the  overall  efficiency  of  the  network.   The 
method  worked  quite  effectively,  but  it  has  the  disadvantage 


of  being  very  expensive.   Such  a  method,  however,  used 
outside  the  Central  Business  District  (CBD)  where  fewer 
variables  would  have  to  be  measured  and  analyzed  might  be 
economically  reasonable. 

Another  researcher  (5)  included  change  of  speed  in 
route  evaluation  by  correlating  "cost,  comfort,  and  safety" 
with  a  number.   The  number,  however,  also  was  based  on 
change  of  direction  and  change  of  time. 

N.  =  A  time  x  A  speed  x  A  direction 

The  resulting  number  does  not  mean  anything  by  itself;  it 
must  be  compared  with  another  to  show  relative  improvement 
along  a  particular  section.   Unfortunately,  this  method 
lumps  all  the  influential  factors  of  quality  of  service 
together.   The  analyzer  has  no  way  of  determining  the 
cause  of  quality  deterioration,  whether  it  is  due  to 
traffic  volume  or  the  facility's  design  characteristics. 

Finally,  in  Hejal's  report  (7)  a  regression  analysis 
was  used  to  obtain  a  relationship  between  six  traffic 
parameters  and  the  running  speed  for  a  two-lane  rural  high- 
way.  Data  were  collected  using  the  average  car  method, 
noting  the  causes  of  interference  and  including  other 
factors  or  variables  from  a  road  inventory.   The  results 


obtained  compared  well  with  reality  and  at  the  same  time 

2 
had  a  high  coefficient  of  multiple  determination  (R  ) .   Thi 

method  proved  to  be  both  economical  and  simple  to  perform. 


PURPOSE 

The  specific  purpose  of  this  research  was  to  develop 
a  model  for  predicting  the  average  running  speeds  of 
vehicles  on  arterial  streets  in  fringe  areas  of  the  CBD. 
This,  it  was  assumed,  could  be  achieved  by  investigating 
the  influence  of  traffic  volume,  traffic  controls,  and 
design  features  of  the  street.   This  model  has  two  functions 
one,  to  predict  the  speed  that  an  average  vehicle  will 
travel  along  a  facility;  and  two,  to  evaluate  selected 
causes  of  speed  reduction. 

It  is  hoped  that  the  proposed  model  will  be  useful  on 
all  arterial  streets,  both  existing  and  proposed.   On 
existing  streets  it  should  emphasize  to  the  engineer  where 
improvements  should  be  made  and  their  relative  benefit. 
On  proposed  streets  it  should  indicate  during  the  design 
stage  the  average  running  speed  of  the  vehicles  that  will 
use  it  under  anticipated  conditions  and,  if  this  speed 
proves  to  be  too  low,  where  design  changes  need  to  be  made. 
Another  possible  use  will  be  the  determination  of 
appropriate  speed  limits. 

The  engineer  today  can  estimate  a  desired  quality  of 
service  from  an  evaluation  of  the  capacity  of  a  facility. 
The  results  of  this  research  will  be  another  tool, 


hopefully  one  which  is  easier  and  more  accurate  of 
application.   The  following  statement  taken  from  the 
"Highway  Capacity  Manual-196  5"  (9)  is  a  warrant  for  conduct 
of  this  study:   "Insufficient  data  are  available  to  attempt 
to  develop  correction  factors  (on  capacity  and  speed)  for 
such  mid-block  influences  as  curb  parking."   It  was 
apparent  that  more  research  was  needed  on  factors  affecting 
average  speed. 


DATA  COLLECTION 

Determination  of  Sample  Size  for  Speed  Measurements 
At  the  outset  of  the  data  collection  stage  of  the 
study  a  section  sample  size  had  to  be  chosen.   This  sample 
would  consist  of  a  number  of  running  speed  observations 
along  one  section  of  roadway  in  one  direction.   The 
decision  was  made  to  calculate  the  size  using  a 
statistical  approach  (more  observations  would  be  an 
inefficient  use  of  both  time  and  money) . 

Before  the  calculation  of  the  sample  size  could  be 
completed  certain  assumptions  had  to  be  made.   First,  it 
was  assumed  that  the  running  speeds  recorded  on  the  test 
sections  would  be  normally  distributed.   If  this  were  so 
then  the  following  formula  could  be  used  for  the 
calculation: 

n  =  z2   a2  /  E2     (14) 
ft 
2 

where 

n  =  sample  size 

z  =  table  value  based  on  the  chosen  confidence  level 

a  =  standard  deviation  of  the  population 

E  =  acceptable  error 

The  value  of  ot,,  the  probability  of  a  Type  I  error  chosen 

was  0.05  and  the  value  of  E  chosen  was  +  2.0  mph .   The 


value  used  for  the  standard  deviation  of  the  population, 
a,  had  to  be  determined  from  a  pilot  study  since  there 
were  no  recent  data  available. 

The  pilot  study  was  conducted  in  West  Lafayette, 
Indiana  on  Northwestern  Avenue  (U.S.  52)  between  Stadium 
Avenue  and  Lindberg  Road,  a  distance  of  approximately  1.1 
miles.   Data  were  collected  for  two  directions  on  three 
separate  Tuesdays  in  three  months  in  1970.   The  license 
plate  method  of  data  collection  was  used  during  the  early 
afternoon  of  each  day.   A  fifteen  minute  period  was 
allocated  to  each  direction  for  collection,  providing 
between  thirteen  and  twenty-five  pieces  of  usable  data 
for  analysis. 

The  analysis  showed  that  the  standard  deviation 
ranged  from  1.669  to  4.851  (requiring  sample  sizes  from 
three  to  twenty-two) .   However,  the  majority  of  the  values 
were  centered  around  3.2,  producing  a  sample  size  of  ten. 

It  was  at  this  point  that  a  homogeneity  of  variance  test 

^   2 
was  conducted  to  determine  whether  the  six  values  of  o 

were  equal.   A  Statistical  Program,  DATASUM  which  includes 

the  Foster-Burr  Test  (Q-Test)   was  used  to  determine  this 

(See  Appendix  A  for  description  of  this  test) .   The  data 

passed  the  Foster-Burr  Test.   Hence,  it  was  decided  to 

accept  the  hypothesis  that  the  variances  (and  standard 

deviations)  were  equal.   A  sample  size  of  ten  was  then 

chosen  for  use  in  data  collection. 


Selection  of  Study  Sections 

Once  the  sample  size  for  each  section  had  been  chosen 
(ten  observations)  it  was  then  necessary  to  determine  the 
number  of  cities  and  the  number  of  sections  in  each  city 
to  be  examined. 

At  the  outset  of  this  research  project  it  was  decided 
that  the  work  should  be  limited  to  cities  whose  population 
ranged  from  thirty  to  one  hundred  and  fifty  thousand 
people.   There  were  two  reasons  for  this  decision:  economy 
and  expedience.   Economy  refers  to  conserving  the  limited 
resources  of  both  time  and  money.   This  was  achieved  by 
restricting  the  data  collection  to  nine  Indiana  cities  in 
order  to  limit  the  resources  expended  in  traveling  between 
cities.   Cities  with  populations  less  than  thirty  thousand 
were  not  included  because  the  characteristics  of  their 
arterial  streets  are  considerably  different  than  such 
streets  in  larger  cities,  i.e.  the  fringe  area  is  very 
small  or  non-existent,  homogeneous  street  sections  are 
very  short,  and  traffic  engineering  is  often  not  used.   The 
ground  for  not  examining  cities  over  one  hundred  and  fifty 
thousand  population  is  that  they  are  too  few  in  number 
(Table  1) .   According  to  Table  1  eighty-five  percent  of  all 
Indiana  cities  with  populations  greater  than  thirty 
thousand  fall  within  the  specified  range.   At  the  national 
level  the  figures  are  just  as  impressive:  eighty-three 
percent  of  all  cities  over  twenty-five  thousand  have 
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TABLE  1 
POPULATION  GROUPINGS  OF  CITIES  IN  U.S.A.  AND  INDIANA 


U.S.A. 
(1960  Census) 

Number  of  cities  with 
populations  >  10000     1899 

Number  of  cities  with 
populations  >  25000     765 


Number  of  cities  with 

populations  between 

25000  and  250000        714 

Number  of  cities  with 

populations  between 

25000  and  100000        633 


Indiana 
(1970  Census) 

Number  of  cities  with 
populations  >  30000       20 

Number  of  cities  with 

populations  between 

30000  and  150000         17 
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populations  between  twenty-five  and  one  hundred  thousand, 
while  ninety-three  percent  of  all  cities  over  twenty-five 
thousand  have  populations  between  twenty- five  and  two 
hundred  and  fifty  thousand. 

In  order  to  economize  limited  resources  and  to  aid 
some  of  the  smaller  cities  of  the  state  by  supplying  them 
with  study  data,  it  was  decided  to  collect  all  data 
within  the  state  of  Indiana.   After  organizing  the  seventeen 
cities  that  were  included  in  the  specified  range  into  the 
three  classes  it  was  found  that  the  second  and  third 
classes  each  contained  three  cities.   The  decision  was 
then  made  to  use  three  cities  from  the  first  class,  there- 
by creating  nine  cities  to  be  investigated  (Table  2) .   In 
Class  I  the  preference  of  one  city  over  another  included 
consideration  of  its  proximity  to  Purdue. 

Each  section  investigated  was  observed  under  two 
traffic  conditions.   The  first  was  the  afternoon  off-peak 
and  the  second  was  the  evening  peak  period.   The  evening 
peak  was  chosen  instead  of  the  morning  because  it  tends  to 
be  better  defined  and  greater  in  magnitude.   However, 
using  these  two  conditions  covers  most  of  the  spectrum  of 
problem  traffic  conditions  except  the  possible  situations 
where  a  two-way  street  carries  high  volumes  in  both 
directions  during  the  noon  lunch  hour. 

The  judgment  was  made  to  research  ten  sections  in  each 
city  to  provide  a  sizeable  number  of  observations  at  the 
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TABLE  2 
STUDY  CITIES  AND  THEIR  CLASSIFICATION 

Class  Population  Range  City 

I  30000  -  49999  Lafayette 

Kokomo 
Marion 

II  50000  -  99999  Anderson 

Muncie 
Terre  Haute 

III  100000  -  149999  Hammond 

South  Bend 
Evansville 
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completion  of  data  collection. 

The  choice  of  streets  actually  studied  was  accomplished 
by  visual  inspection.   In  making  this  judgment  three  factors 
were  employed.   The  first  was  location  within  the  city. 
This  was  restricted  to  the  fringe  area  of  the  Central 
Business  District.   The  second  criterion  was  section  length. 
In  chosing  the  test  sections  an  attempt  was  made  to 
maintain  a  minimum  length  of  one  half  mile;  however,  this 
proved  quite  difficult  at  times  due  to  the  third  selection 
factor,  homogeneity.   Each  street  used  had  to  be  uniform 
along  the  length  analyzed  in  the  following  four  aspects: 

1.  Street  Type  (i.e.  one-way  or  two-way) 

2.  Parking 

3.  Approach  Width 

4.  The  Absence  of  STOP  and  YIELD  Signs  in  the 
Section 

For  the  most  part,  satisfying  the  above  conditions, 
with  the  exception  of  length,  was  not  exceedingly  difficult. 
Of  the  ninety  sections  investigated  sixty-three  percent 
were  one-half  mile  or  longer  while  slightly  over  eighty- 
three  percent  were  0.45  miles  or  longer  (Figure  1).   In 
the  cases  where  the  section  length  was  less  than  four- 
tenths  of  a  mile  (only  seven  of  ninety  sections)  the 
number  of  major  causes  of  speed  reduction,  e.g.  traffic 
signals  and  railroad  crossings,  was  kept  to  a  minimum 
(usually  one)  in  order  to  guarantee  that  the  test  vehicle 
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could  cruise  and  not  merely  accelerate  and  decelerate. 
Refer  to  Table  3  for  a  listing  of  the  streets  selected. 

Although  ten  sections  of  arterial  streets  were 
desired  in  each  of  the  nine  cities,  only  eight  adequate 
sections  could  be  found  in  Marion.   As  a  consequence  an 
additional  two  sections  were  used  in  Kokomo,  a  similar 
sized  city  in  the  same  area  of  the  state. 

Although  the  posted  speed  limits  of  the  sections 
ranged  from  20  mph  to  40  mph ,  less  than  eight  percent  of 
the  streets  had  speeds  different  than  30  mph.   The  average 
speeds  on  those  streets  with  a  posted  speed  less  than 
30  mph  were  found  to  be  about  thirty  miles  per  hour,  while 
those  streets  with  a  posted  speed  greater  than  30  mph  were 
approximately  thirty-five  miles  per  hour. 

During  the  off-peak  traffic  condition  observations 
were  taken  on  all  ninety  sections;  however,  during  the 
peak  traffic  condition  observations  were  taken  on  only 
sixty-two  sections.   This  was  a  result  of  the  absence  of 
a  peak  volume  on  some  streets  during  the  evening  peak 
period.   In  some  cases  the  section  was  a  one-way  street 
of  a  pair,  while  in  others  it  was  one  direction  of  a  two- 
way  street.   As  a  consequence,  of  the  one  hundred  and 
eighty  conditions  that  could  possibly  be  investigated  only 
one  hundred  and  fifty-two  actually  were  analyzed. 
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Data  Collection  Procedures 
The  data  that  were  collected  for  this  project  may  be 
divided  into  three  general  categories:   1)  speed-delay 
data,  2)  volume  data,  and  3)  section  inventory  data.   A 
section  refers  to  either  a  one-way  street  or  one  direction 
of  a  two-way  street. 

The  initial  step  of  research  in  a  city  was  the 
obtainment  of  a  city  street  map  and  the  labeling  of  the 
perimeter  of  the  Central  Business  District  (CBD)  and  the 
arterial  streets.   This  was  usually  done  by  contacting  the 
City  Traffic  Engineer;  however,  in  some  cases  it  was 
necessary  to  speak  to  either  the  Police  Traffic  Officer  or 
the  City  Engineer.   This  also  proved  to  be  a  good 
opportunity  to  speak  to  either  the  Chief  of  Police  or  the 
Police  Traffic  Officer  and  inform  them  of  the  type  of 
study  that  was  to  be  done. 

Once  the  preliminaries  were  complete  the  actual  data 
collection  began.   This  required  driving  to  the  CBD 
perimeter  and  checking  each  arterial  for  length  and  homo- 
geneity.  The  former  was  done  using  a  vehicle  equipped 
with  a  calibrated  Stewart-Warner  Survey  Speedometer 

(Odometer)  while  the  latter  was  accomplished  by  answering 
the  pertinent  questions  of  the  Inventory  Questionnaire 

(Figure  2).   The  remainder  of  the  questionnaire  was 
completed  at  the  researcher's  convenience. 


Inventory  Questionnaire 
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Check  the  appropriate  spaces 

City 

Street  Name 
Type  of  Street 
Parking 


Number  of  Railroad  Crossings 

Number  of  Driveways 

Section  Direction 

Approach  Width 

Lane  Markings 

Section  Direction 

Approach  Width 

Lane  Markings 

Number  of  Signalized 
Intersections 

Number  of  Intersections 
Other  than  Signalized 

Number  of  Alleys 

G/C  Ratios 

Intersecting  Street 


One-Way  Two-Way  

No  Parking  

Parking  (one  side)  

N  E  S  W  side  of  street 
Parking  (both  sides)  


Commercial 
North 


Private 
East 


Yes 


NO 


South 


West 


Yes 


No 


G/C  Ratio 


FIGURE  2.   INVENTORY  QUESTIONNAIRE 


23 


After  a  street  was  chosen,  Streeter  Amet  Traf f icounters 
(Models  RC  or  RCT)  were  installed  in  each  direction.   These 
pneumatic  tube  traffic  volume  counters  were  placed  so  that 
the  volume  that  passed  that  point  was  representative  of 
what  travelled  on  the  whole  section. 

Upon  completion  of  this  phase  a  speed-delay  study  was 
begun,  using  the  average  car  method.   In  brief,  this  method 
involved  one  of  the  researchers  driving  the  test  vehicle 
along  the  section  at  what  he  considered  the  average  speed 
of  the  traffic  stream  while  simultaneously  measuring, 
using  a  stop  watch,  the  total  time  of  the  trip.   At  the 
same  time,  a  recorder  timed  all  stops  and  noted  both  these 
and  other  forms  of  delay.   The  data  sheet  completed  by  the 
recorder  is  shown  in  Figure  3.   Upon  returning  to  the 
office  the  researcher  completed  the  data  sheet  and 
proceeded  to  complete  the  Summary  Sheet  (Figure  4). 

This  procedure  was  followed  on  each  run  with  ten  runs 
performed  under  two  traffic  conditions  on  each  section, 
during  the  off-peak  period  and  again  during  the  evening 
peak  period.   To  determine  the  time  and  magnitude  of  the 
peak  period  a  traffic  volume  counter  was  left  in  position 
for  twenty-four  consecutive  hours  between  Monday  and 
Friday.   However,  this  method  had  to  be  modified  on  many 
sections  due  to  equipment  malfunction  or  damage  to  the 
pneumatic  tubing  by  street  cleaning  equipment.   The 
modification  applied  was  the  installation  of  the  traffic 
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TRAVEL  TIME  STUDY 


ROUTE 


TRIP  STARTED  AT 
TRIP  ENDED  AT 


TRIP  NO. 


DIRECTION 
HRS 


LOCATION 


HRS 


LOCATION 


MILEAGE 


MILEAGE 


REFERENCE 

STOPS 

SLOWS 

POINT 

TIME 

LOCATION 

CAUSE 

DELAY 
SEC. 

LOCATION 

CAUSE 

TOTAL  TRIP  TIME 

RUNNING  TIME  

DATE 


TOTAL  TRIP  LENGTH 
STOPPED  TIME 


OVERALL  SPEED 
RUNNING  SPEED 


WEATHER 


FIGURE  3.   DATA  SHEET  FOR  TRAVEL  TIME  STUDY 
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Route 


Between 
Date 


Time 


Summary  Sheet 

Length 

And 


AM 
PM  to 


AM 

PM  Weather 


Major  Cause  for  Delay 

Trip 
No. 

No. 

Times 

Stopped 

Total 

Sec. 

Stopped 

No. 

Times 

Slowed 

Ave- 
rage 
Speed 

TOTALS 

AVERAGES  PER  TRIP 

Date 


Compiled  By 


FIGURE  4.   SUMMARY  SHEET  FOR  TRAVEL  TIME  STUDY 
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volume  counter  during  the  late  morning  hours  and  the 
removal  of  it  the  same  day  during  the  early  evening  hours. 

If  this  type  of  study  is  to  be  a  success  the  people 
being  observed  must  not  know  what  is  taking  place.   The 
test  vehicle  is  important  in  this  matter.   The  vehicle 
used  for  the  data  collection  was  a  1962  Chevrolet  four 
door  sedan;  in  no  way  was  the  vehicle  conspicuous. 

The  Inventory  Questionnaire  (Figure  2)  was  the  only 
original  form  used  in  the  study,  the  others  being  standard 
forms  from  the   Manual  of  Traffic  Engineering  Studies   (10) 
It  functioned  as  a  substitute  for  a  condition  diagram  of 
the  section  using  both  qualitative  and  quantitative 
questions.   All  the  required  measurements  could  be 
completed  by  one  person  with  the  exception  of  that  of  the 
approach  width.   Usually  three  width  measurements  were 
taken,  one  at  each  end  and  one  in  between,  using  a  metallic 
tape.   The  G/C  ratios  were  usually  supplied  by  the  City 
Traffic  Engineer;  however,  in  cases  where  they  were  not, 
the  measurements  were  made  and  the  percentages  rounded  to 
the  nearest  integral  number. 

An  inherent  problem  in  the  average  car  method  arose  in 
data  collection.   During  off-peak  measurements  there  were 
periods  when  the  test  vehicle  was  not  included  within 
visible  range  of  other  vehicles.   The  driver  of  the  test 
vehicle  then  had  to  make  the  decision  of  choosing  a  speed 
that  he  felt  was  average.   To  wait  outside  the  section  for 
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other  vehicles  usually  does  not  alleviate  the  problem  -  all 
too  often  under  low  volume  conditions  a  group  of  other 
vehicles  does  not  materialize.   It  was  sometimes  difficult 
under  such  conditions  for  the  driver  of  the  test  car  to  be 
certain  he  drove  at  the  speed  of  the  average  vehicle. 
The  bulk  of  the  data  collection  was  done  between 
February  and  May  of  1971  and  only  during  dry  weather 
conditions.   It  should  also  be  noted  that  the  data  was 
collected  on  weekdays,  with  the  exception  of  one  Saturday 
afternoon. 
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DATA  ANALYSIS 

Original  Variables 

Table  4  is  a  listing  of  the  variables  used  in  the 
development  of  the  model.   Of  the  twenty-two  variables 
investigated,  only  eleven  are  actually  included  in  the 
proposed  model.   It  should  also  be  noted  that  one  of  the 
twenty-two  variables,  Driver,  was  added  just  prior  to  the 
choosing  of  the  model.   This  was  done  to  try  to  improve 
the  model,  but  which  it  did  to  only  a  very  small  degree. 

Information  for  many  of  the  variables  was  obtained 
directly  from  the  Inventory  Questionnaire.   Data  for  the 
twenty  first  variable,  Progression,  however,  resulted 
from  value  judgments  on  the  part  of  the  researcher.   Value 
judgments  present  difficulties  because  other  researchers 
might  have  made  different  decisions.   In  the  case  of  a 
section  where  there  were  zero  or  one  signalized  inter- 
section, the  section  was  considered  to  have  good 
progression.   Where  the  section  had  more  than  one  signalized 
intersection,  the  researcher  on  the  basis  of  speed  progression 
actually  obtained  during  the  runs  made  the  judgment  as  to 
whether  or  not  good  progression  was  present. 

The  twenty-second  variable,  Driver,  represents  whether 
Driver  A  or  Driver  B  was  at  the  wheel  of  the  test  vehicle. 
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TABLE  4 
VARIABLES  TESTED  IN  DEVELOPMENT  OF  THE  MODEL 


Variable 
Number 


3(A) 
4 


5(B) 


Variable 
Name 

Running 
Speed 


Peak 


Section 
Length 

Street 
Type 


Approach 
Width 


6 

Lane 

Marking 

7(C) 

Signalized 

Inter. 

8(D) 

Unsignal- 

ized  Inter 

9(E) 

Alleys 

10 

Private 

Driveways 

11 

Commercial 

Driveways 

Description  and  Information  Coded 

The  length  of  the  section  divided 
by  the  time  in  motion  for  one  trip 
along  that  section  (mph) . 

Denotes  whether  measurements  were 
done  during  the  P.M.  Peak  period 
or  not. 
Off-Peak  =  0 
P.M. Peak  =  1 

The  length  of  the  section  in  miles 


Denotes  whether  the  section  is  a 
one-  or  two-way  street. 
One-way  =  0 
Two-way  -  1 

The  curb  to  curb  width  (face  of 
curb)  in  feet  for  one-way  streets 
and  the  curb  to  centerline  width 
in  feet  for  two-way  streets. 

Used  for  multilane  facilities  only. 
No  =  0 
Yes  -  1 

The  number  of  signalized  inter- 
sections included  in  the  section. 

The  number  of  unsignalized  inter- 
sections included  in  the  section. 

The  number  of  alleys  included  in 
the  section. 

The  number  of  private  driveways 
included  in  the  section. 

The  number  of  commercial  driveways 
included  in  the  section. 
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TABLE  4  (Continued) 


Variable 
Number 

12 


Variable 
Name 

Day 


Description  and  Information  Coded 

The  day  of  the  week  that  the  data 
was  collected. 
Sun  =  1 
Mon  =  2 


13(F) 


14(G) 

15 
16 
17(H) 


Pop. 

Railroad 
Crossings 

Parking 

Parking 

Parking 


18  (I) 


19  (J) 


20 


Volume 


Lowest 
G/C  ratio 


Average 
G/C  ratio 


Sat  =  7 

Denoted  the  population  class 
30000-49999  =  0 
50000-99999  =  1 
100000-149999  =  2 

The  number  of  individual  crossings 
(not  the  number  of  tracks)  included 
in  the  section. 

Variables  15,  16,  and  17  are 
Parking  dummy  variables  whose 
relationship  is: 
15    16    17 

0  0     0   Parking  on  both  sides 

of  the  section 

1  0    0   No  parking  on  either 

side  of  the  section 
0     10   Parking  on  the  left 

side  of  the  section 

(one-way  streets) 
0     0     1   Parking  on  right  side 

of  the  section. 

The  fifteen  minute  volume  which  was 
concurrently  measured  with  the 
observation  (one  trip) . 

The  lowest  G/C  ratio  of  all  the 
signalized  intersections  included 
in  the  section. 

The  average  of  all  the  G/C  ratios 
included  in  the  section. 
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TABLE  4  (Continued) 


Variable 
Number 

21(K) 


22 


Variable 

Name      Description  and  Information  Coded 

Good        A  value  judgment  of  whether  good 
Progression  progression  existed  in  those 

sections  which  contained  two  or 
more  signalized  intersections. 
No  =  0 
Yes  =  1  (also  used  for  sections 

with  zero  or  one  signalized 
intersection) 


Driver 


Denotes  driver  of  the  test  vehicle 
Driver  A  =  0 
Driver  B  =  1 


Range  of  Variable  Measurements 


Variable 
Number 

3 

7 

13 
14 
18 
19 
20 


Range 

0.298  miles  -  1.199  miles 

0  traffic  signals  -  6  traffic 
signals 

30000  people  -  150000  people 

0  crossings  -  2  crossings 

24  vehicles  -  354  vehicles 

25%  -  100% 

25%  -  100% 
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Upon  examination  of  the  covariance  matrix  it  was  found 
that  one  driver  drove,  on  the  average,  nine-tenths  of  a 
mile  per  hour  faster  than  the  other. 

A  listing  of  the  limitations  of  the  individual 
variables,  where  this  is  relevant,  is  given  at  the  end  of 
Table  4. 

Preliminary  Testing 

As  data  collection  progressed  it  was  felt  necessary 
to  determine  if  the  data  indicated  a  homogeneous  population, 
To  accomplish  this  the  Q-Test  contained  in  Statistical 
Program  DATASUM  was  again  employed.   Table  5  is  a  summary 
of  these  results,  including  the  results  of  the  tests  made 
at  the  completion  of  the  data  collection  stage.   The  tests 
made  during  data  collection  indicated  homogeneity  while 
those  made  upon  completion  of  data  collection  indicated 
heterogeneity.   A  possible  reason  for  the  failures  for 
the  data  at  completion  might  be  the  non-normality  (see 
Appendix  B)  and  not  the  heterogeneity  of  the  data  (4). 

At  this  stage  of  the  analysis  it  was  realized  that 
interaction  between  the  variables  was  quite  high  and  an 
attempt  was  made  to  determine  it,  using  two-way  ANOVA  tests. 
Unfortunately,  due  to  the  incompleteness  of  the  cells,  it 
was  found  impossible  to  perform  these  tests.   Therefore, 
regression  analysis  was  employed,  using  cross  products  to 
investigate  the  interaction. 
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TABLE  5 
SUMMARY  OF  HOMOGENEITY  TEST  RESULTS 

Number  of  Variances  Q-Test 

Tested  (0.05  Level) 

(Tests  Made  During  Data  Collection) 

6  Passed 

12  Passed 

16  Passed 

20  Passed 

24  Passed 

44  Passed 

(Tests  Made  Upon  Completion  of  Data  Collection) 

62  (Peak)  Failed 

90  (Off-Peak)  Failed 

152  (Total)  Failed 
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Stepwise  Regression 
Towards  the  completion  of  the  data  collection  phase, 
investigation  of  Statistical  Program  BMD  2R  (Stepwise 
Regression)  (17)   began.   Using  three  hundred  and  sixty 
observations,  different  F-ratios  and  Tolerances  were 
examined  for  deviation  between  models  created.   Due  to  a 
lack  of  deviation  in  the  models  the  following  values  were 
chosen  and  utilized  throughout  the  analysis:   F-ratio  = 
1.75  and  Tolerance  =  0.001. 

Phase  1 
Using  a  sample  of  six  hundred  observations  and 
containing  only  the  first  seventeen  variables  (see  Table  4) , 
the  input  for  a  model  was  created.   Nine  of  these  variables 
were  chosen  for  use  in  forming  transgenerated  variables, 
e.g.  squares  and  cross  products.   These  nine  variables  were: 

1.  Section  Length 

2 .  Approach  Width 

3.  Signalized  Intersections 

4.  Unsignalized  Intersections 

5.  Alleys 

6.  Railroad  Crossings 

7.-9.   Parking  Dummy  Variables 
Those  transgenerated  variables  which  were  either  removed 
from  the  model  during  its  development  or  never  entered 
because  of  low  F-ratios  were  eliminated  from  further 
consideration. 
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Soon  afterwards  another  two  hundred  and  ten 
observations  were  added  and  the  process  continued.   The 
logarithms  of  some  of  the  original  variables  were  also 
examined  during  this  phase  of  the  regression  analysis. 
Later  another  three  hundred  observations  were  added  to  the 
model's  input. 

Phase  2a 
Phase  2a  was  an  attempt  at  utilizing  the  most 
significant  variables  found  from  analyses  of  the  eight 
hundred  and  ten  observations  and  the  eleven  hundred  and 
ten  observations  of  the  previous  phase.   At  the  same  time 
variables  concerning  volume  and  traffic  signal  phasing 
were  added.   The  input  for  the  Phase  2a  models  is 

diagrammatically  shown  in  Figure  5 .   At  the  completion  of 

2 
Phase  2a  the  R  value  was  0.6679  for  a  model  containing 

twenty-nine  variables  (original  and  transgenerated) . 

Phase  2b 
Simultaneous  to  the  running  of  the  Phase  2a  models 
Phase  2b  was  attempted.   The  Phase  2b  input  is  shown  in 
Figure  5.   The  function  of  this  phase  was  to  check  whether 
or  not  the  three  hundred  additional  observations  of  the 
eleven  hundred  and  ten  observation  model  had  caused  certain 
variables  to  become  more  significant.   At  the  termination 
of  Phase  2b  an  R  value  of  0.6919  had  been  reached  using 
thirty-six  variables.   Because  the  final  Phase  2b  model 
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appeared  more  promising  than  its  Phase  2a  counterpart,  it 
was  decided  to  enter  Phase  3a  using  the  most  significant 
variables  from  the  Phase  2b  model. 

Phase  3a 
Phase  3a  was  based  on  the  same  input  as  Phase  2b 
except  for  the  addition  of  the  more  significant  trans- 
generated  variables  entered  during  Phase  2b  and  the 
reciprocals  of  some  of  the  variables.   These  reciprocals 
were  also  used  to  form  new  cross  products.   Although  many 

new  transgenerated  variables  were  examined  the  value  of 

2 
R  remained  relatively  unchanged. 

Phase  4a 
Phase  4a  was  the  stage  in  which  illogical  trans- 
generated  variables  were  removed  from  the  model.   In 
addition  to  the  twenty-one  original  variables,  another  ten 
logical  (logical  in  individual  effect)  variables  were 
selected  from  the  most  significant  transgenerated  variables 
entered  into  each  of  five  Phase  3a  models.   It  was  from 
this  phase  that  the  final  prediction  model  was  developed. 

The  Prediction  Model 
The  model  that  was  chosen  contains  ten  terms  and 

utilizes  eleven  original  variables.   It  represents  the 

2 
Phase  4a  model  where  each  term  increased  the  value  of  R  by 

approximately  one  percent.   The  model  is  shown  below  in 
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test  format,  followed  by  the  Summary  Table  from  the  step- 
wise regression  program  (Table  6). 


Y  =  19.62500  +  10.38018X3  +  0.07562X5  -  0.70139X? 

-  0.22178Xg  +  1.25800X13  -  0.03118Xlg  +  7.38855X19 

-  0.18590Xo_  -  0.24199X-,  +  0.01815X-,. 

22  23  24 


where 


=  predicted  average  running  speed 


Y 

s 

predic 

X22 

= 

X8X14 
X3 

X23 

= 

X9X17 
X3 

X24 

= 

X18X21 

See  Table  4  for  definitions  of  all  other  variables. 

In  Chapter  3  of  Draper  and  Smith  (2) ,  "The  Examination 

of  Residuals,"  it  states: 

We  can  see  that  ...  the  residuals  e.  are  the 
differences  between  what  is  actually  observed,  and 
what  is  predicted  by  the  regression  equation  -  that 
is,  the  amount  which  the  regression  equation  has 
not  been  able  to  explain.   Thus  we  can  think  of 
the  e.  as  the  observed  errors  if  the  model  is 
correct.   Now  in  performing  the  regression  analysis 
we  have  made  certain  assumptions  about  the  errors; 
the  usual  assumptions  are  that  the  errors  are 
independent,  have  zero  mean,  a  constant  variance, 
a^ ,  and  follow  a  normal  distribution. 

Appendix  C  contains  the  frequency  plot  of  the  residuals 

and  the  Kolmogorov-Smirnov  Test  for  normality.   The 

frequency  plot  looked  normal  and  the  test  accepted  the 
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hypothesis  at  a  significance  level  of  0.05.   The  mean  of 
the  residuals  was  zero.   To  test  the  independence  of  the 
residuals  two  different  approaches  were  used,  neither 
producing  favorable  results.   Each  approach  used  a 
different  ordering  of  the  "runs"  test  outlined  in  Draper 
and  Smith  (2).   The  first  ordering  was  a  time  approach. 
This  was  done  by  chronologically  ordering  the  residuals  of 
ten  randomly  chosen  workdays.   The  results  produced, 
however,  were  unsatisfactory.   The  ordering  of  the 
predictions  (from  lowest  to  highest)  and  their  corresponding 
residuals  was  the  second  approach.   Again,  the  results 
were  the  same. 

A  study  of  the  plot  of  the  residuals  and  the 
observations  (running  speeds)  showed  that  as  the,  speeds 
approached  the  limits  of  the  observation  range,  the 
residuals  tended  to  be  of  one  sign.   Therefore,  if  a 
chosen  speed  were  at  either  extremity  all  the  residuals 
would  be  of  one  sign,  independent  of  time  ordering. 
Because  all  speeds  taken  during  a  single  workday  were  on 
only  a  few  sections,  the  chosen  runs  on  a  single  workday 
(the  basis  of  this  ordering  approach)  tended  to  have 
similar  speeds,  sometimes  at  one  of  the  extremities. 

In  many  instances,  residuals  are  ordered  with  respect 
to  one  of  the  independent  variables.   This  was  not  possible 
in  this  study  because  the  variables  were  discrete  over  a 
narrow  range.   In  place  of  one  of  the  independent  variables, 
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the  prediction  Y  had  to  be  used.   This  approach  did  not 
prove  independence  because  the  ordering  from  lowest  to 
highest  provided  large  groups  of  residuals  of  similar  sign. 

Though  the  tests  to  determine  whether  or  not  the 
residuals  were  independent  failed,  there  may  be  good 
reasons  for  this  result.   The  test  used  is  most  sensitive 
to  independence  for  models  of  excellent  fit.   The  model 
tested  explains  about  sixty  percent  of  the  variation  of 
the  total  sums  of  the  squares  of  the  running  speed.   Finally, 
as  already  noted,  the  ordering  choices  for  which  the 

residuals  were  tested  presented  difficulties. 

2 

The  multiple  correlation  coefficient  (R  )  for  the  model 

is  0.5926.   There  are  three  basic  reasons  why  this  value 
is  not  higher.   The  first  reason  is  that  the  number  of 
terms  appearing  in  the  model  was  kept  to  a  minimum  for 

calculation  ease.   Adding  another  ten  terms  would  have 

2 

raised  the  value  of  R  by  only  another  six  percent  and  some 

of  these  were  not  individually  logical.   It  should  be 

2 
noted  that  the  highest  value  of  R  achieved  during  the 

analysis  was  0.7022  using  a  model  containing  forty-two 

terms. 

The  second  and  probably  the  most  important  reason  is 

the  variability  of  drivers  in  the  traffic  stream.   Many  of 

the  factors  that  influence  a  driver's  actions  are  known 

but  cannot  be  measured  either  quantitatively  or 

qualitatively.   To  give  some  examples:   1)  motivation 
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factors,  such  as  attitude;  2)  emotional  factors,  such  as 
anger  and  fear;  and  3)  other  modifying  factors,  such  as 
fatigue,  climate,  and  time  of  day.   Motorists  who  are 
fatigued  and  inattentive  drive  differently  than  they 
normally  would  and  some  may  even  try  to  compensate  for 
these  factors.   Unfortunately,  in  making  this  type  of 
study  the  researcher  has  great  difficulty  determining 
which  of  these  factors  are  influencing  (or  to  what  extent) 
the  drivers  in  the  traffic  stream  which  he  is  studying. 

The  final  reason  is  variability  due  to  the  method  of 
data  collection  -  the  average  car  method.   Late  in  the 
analysis  stage  the  question  of  variability  between  the  two 
test  drivers  arose.   The  twenty  second  variable,  Driver, 
was  then  created  to  examine  this.   First,  it  was  tested  in 
a  stepwise  regression  program  with  the  original  variables 
and  proved  to  be  more  important  than  some  of  them.   This 
new  variable  was  tested  with  the  terms  of  the  final  model. 

The  result  of  this  test  was  a  negligible  improvement  in 

2 

the  explanation  (R  );  therefore,  it  was  not  included. 

Another  reason  for  this  method's  variability  is  the  already 
discussed  difficult  driver  selection  of  an  average  speed 
under  low  volume  conditions. 

After  the  selection  of  the  model  the  decision  was 
made  to  test  it  using  part  of  the  data  collected.   Ten  of 
the  sections  were  chosen  from  the  data  by  randomly 
ordering  peak  and  off-peak  conditions,  then  randomly 
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ordering  the  section  numbers  and  matching  the  first  ten  of 

each.   The  model  was  tested  with  coefficients  to  five 

places  and  the  coefficients  rounded  to  three  places. 

Since  the  results  were  adequate  for  both  coefficient 

conditions  (Table  7) ,  it  was  decided  to  use  the  model  with 

rounded  coefficients  to  simplify  calculation. 

To  determine  the  accuracy  of  the  model  as  a  predictor, 

the  standard  deviation  of  each  prediction,  S~,  was 

Y 
estimated.   The  value  of  S^  could  be  calculated  using  an 

Y 
equation  similar  to  the  equation  for  the  one  independent 

variable  case,  shown  below: 


2    2 
S/s  =  s„ 

Y    E 


1  +  1  +  (X-X)2 


n    Z    xT 


where 

X  =  the  independent  variable 
x  *   X-X 

The  assumption  was  made  that  the  third  term  in  brackets 

was  small  and  it  was  known  that  1/n   (1/1520)  was  very 

small,  so  that 

s2  ~  S2 
Y     E 

The  standard  error  of  the  estimate  (3.417)  then  approximates 
the  actual  value  of  s~. 
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The  model  in  its  final  form  using  alphabetical 
subscripts  is  shown  below. 


Y  -  19.625  +  10.380XA  +  0.076Xfi  -  0.701XC 
-  0.222XD  +  1.258Xp  -  0.031XJ  +  7.389Xj 


-  0.186XDXG  -  0.242XEXR  +  CO^X^ 
A  A 


where 


XA=  X3 


XB"  X5 


xc  -  X? 


XD-  X8 


XE  =  X9 


XF  "  X13 


XG  "  X14 


XH  "  X17 


XI  =  X18 


XJ  "  X19 


XK  *  X21 


Though  eleven  variables  were  entered  into  the  model, 
eleven  others  were  not.   It  might  be  of  interest  to 
examine  some  of  them  now.   Proceeding  in  the  order  of 
significance,  the  next  variable  that  would  have  been 
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entered  was  Street  Type  (one-way  or  two-way)  the  coefficient 
of  which  was  illogical.   Surprisingly  enough,  the  variable 
which  denoted  whether  or  not  the  prediction  was  being  made 
for  the  peak  condition  entered  into  the  model  very  late  in 
the  analysis,  indicating  a  lack  of  significance,  and  had 
a  small  coefficient.   Another  variable  which  originally 
was  hypothesized  as  being  influential  was  Commercial 
Driveways.   This  variable  also  entered  the  model  late  in 
the  regression  analysis. 
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ANALYSIS  OF  STOP  DELAY 

The  prediction  of  average  running  speed  is  a  reasonable 
means  of  determining  and  evaluating  some  of  the  causes  of 
speed  reduction;  however,  to  justify  where  and  what  improve- 
ments should  be  made  it  is  usually  necessary  to  have 
knowledge  concerning  any  stop  delay  time.   This  section 
will  be  devoted  to  pointing  out  those  locations  where  stop 
delays  occurred  in  this  study  and  their  average  values. 
Equipping  the  traffic  engineer  with  both  a  running  speed 
prediction  model  and  average  stop  delay  data  will  enable 
him  to  estimate  where  improvements  should  be  made  and  the 
total  benefits  accrued  therefrom. 

A  major  part  of  this  study  was  devoted  to  the 
measurement  of  stop  delay,  i.e.  the  time  elapsed  while  the 
test  vehicle  was  at  a  complete  stop.   On  the  sections 
investigated  in  this  research  project  these  stop  delays 
were  caused  by  three  different  factors: 

1.  Traffic  Signals 

2.  Railroad  Crossings 

3.  Miscellaneous  Delays 

Utilizing  the  data  from  this  study,  it  was  found  that 
Traffic  Signals  had  the  greatest  influence  on  stop  delay 
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for  both  the  off-peak  and  peak  condition.   For  the  off-peak 
condition  the  percentage  of  the  total  stop  delay  attributed 
to  each  factor  was  as  follows:   81.90%,  Traffic  Signals; 
15.50%,  Railroad  Crossings;  and  2.60%,  Miscellaneous 
Delays.   During  the  peak  period  the  same  factors  had  values 
of  84.91%,  12.47%,  and  2.62%,  respectively. 

Miscellaneous  Delays  contains  any  other  causes  of 
stop  delay  except  by  traffic  signals  and  railroad  crossings; 
however,  during  this  study  it  was  produced  by  left  turns, 
right  turns,  school  crossings,  stopped  vehicles  in  the 
roadway,  and  pedestrians  crossing  the  roadway.   The  major 
and  most  common  delay  producing  element  of  this  factor  was 
the  left  turn  (36.80%  of  this  factor's  stop  delay  during 
the  off-peak  period  and  74.90%  during  the  peak  period). 

Tables  8  and  9  contain  the  delay  data  collected 
during  the  study.   Table  8  is  a  tabulation  of  the  off-peak 
data  while  Table  9  is  a  tabulation  of  the  peak  data.   The 
delays  recorded  are  the  averages  of  the  stop  delays 
encountered  during  the  ten  observations  along  each  section 
for  each  condition.   Within  Miscellaneous  Delays  the 
delays  have  been  presented  as  the  average  delay  per  mile 
as  opposed  to  the  localized  effects  of  Traffic  Signals  and 
Railroad  Crossings. 

A  listing  of  the  important  statistics  developed  from 
the  stop  delay  data  is  shown  in  Table  10. 
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The  delay  encountered  at  traffic  signals  was  found  to 
increase  as  the  number  of  signals  increased  and  as  the 
condition  changed  from  off-peak  to  peak.   Due  to  the  small 
number  of  observations  involving  sections  containing  four 
or  more  signals  analysis  was  limited  to  those  sections 
containing  one,  two,  and  three  signals.   During  the  off- 
peak  period  the  average  delay  caused  by  a  single  signal 
was  about  nine  seconds  while  the  delay  of  a  section  which 
contained  three  signals  was  found  to  be  approximately 
fourteen  seconds.   For  the  peak  condition  the  respective 
values  were  approximately  fourteen  and  one-half  seconds 
and  twenty-one  and  one-half  seconds.   If  each  condition  is 
examined  individually  it  is  found  that  as  the  number  of 
traffic  signals  per  section  increased  the  individual 
effect  of  each  decreased.   During  the  off-peak  period  the 
individual  effect  for  one,  two,  and  three  signals  is  9.13, 
5.95,  and  4.71  seconds,  respectively.   Similarly,  for  the 
peak  period  the  delays  are  14.62,  8.93,  and  7.18  seconds, 
respectively.   Intuitively  this  reflects  the  effect  of 
those  sections  which  exhibited  good  progression.   The 
standard  deviations  that  were  calculated  tend  to  emphasize 
the  effect  of  the  few  signals  where  the  delay  for  each 
observation  was  large  because  the  intersection  capacities 
were  exceeded.   It  was  at  these  intersections  that  the 
driver  waited  through  as  many  as  three  and  four  total 
cycles,  whereas  at  the  more  typical  traffic  signal  the 
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wait  was  only  through  a  portion  of  a  complete  cycle.   This 
situation  clearly  indicates  that  whenever  one  is  dealing 
with  the  condition  where  the  intersection  is  operating  at 
above  capacity,  a  stop  delay  study  at  the  particular 
location  will  be  necessary  to  obtain  accurate  estimates  of 
stop  delay  time. 

The  average  delays  encountered  in  sections  which 
contained  one  or  two  railroad  crossings  (not  tracks)  were 
also  calculated  (Table  10).   The  standard  deviations 
indicate  a  high  degree  of  variability  in  the  stop  delay. 
Though  a  railroad  crossing  may  exist,  the  number  of  times 
that  one  is  delayed  is  often  very  few.   This  is  dependent 
on  the  frequency  that  railroad  stock  uses  the  particular 
crossing.   Furthermore,  if  a  crossing  is  blocked,  the  stop 
delay  of  the  motorist  is  dependent  on  the  length  and  speed 
of  the  train.   In  some  cases  individual  delays  at  railroad 
crossings  in  this  study  were  as  long  as  four  or  five 
minutes.   To  generalize  about  the  magnitude  of  stop  delays 
at  railroad  crossings  is  virtually  impossible.   It  is 
realistic  to  conduct  individual  delay  studies  at  the 
particular  crossings  in  question  to  determine  these  data. 

The  delays  due  to  the  Miscellaneous  Delays  factor  for 
the  off-peak  and  peak  periods  were  both  very  small,  0.40 
and  0.68  seconds  per  mile,  respectively. 

The  application  of  estimated  average  stop  delay  values 
will  be  most  useful  in  determining  and  presenting  benefits 
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of  traffic  engineering  improvements.   The  average  values 
of  delay  found  in  this  study  when  utilized  according  to 
the  restrictions  noted  enables  an  estimate  of  the  benefits 
gained  by  particular  improvements,  e.g.  constructing  a 
parallel  route  with  fewer  traffic  signals,  the  prohibition 
of  left  turns  or  the  utilization  of  one  street  over  another. 
These  benefits  could  be  measured  in  terms  of  either  time 
or  its  chosen  money  equivalent.   The  findings  also 
emphasize  the  fact  that  the  number  of  traffic  signals 
should  be  kept  to  a  minimum  and  that  good  progression  is 
essential  to  minimizing  stop  delay  time.   A  third 
application  of  these  values  might  be  to  point  out  those 
sections  or  intersections  which  are  operating  above 
capacity  by  contrasting  actual  delays  with  the  average 
values  found  in  this  study. 
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CONCLUSION 

The  primary  objective  of  this  project  was  to  formulate 
a  model  for  predicting  the  average  running  speed  on  urban 
arterial  streets.   This  has  been  fulfilled  within 
specified  limits  of  city  size  and  area  within  the  city. 
From  this  project  comes  knowledge  of  the  interaction 
between  frictional  factors  influencing  the  motorist  and  of 
those  frictional  factors  that  most  affect  him. 

The  model  in  its  final  form  is: 


Y  =  19.625  +  10.380XA  +  0.076XB  -  0.701XC 
-  0.222X^  +  1.258XT,  -  0.031XT  +  7.389XT 


-  0.186XDXG  -  0.242X£XH  +  O.OISX^ 
A  A 


where 


X  =  Section  length  in  miles 

X0  =  Approach  width  in  feet 

X_  =  Number  of  signalized  intersections 

Xn  -   Number  of  unsignalized  intersections 

X_  ■  Number  of  alleys 

X  -  Population  class  (30,000  to  50,000  ■  0;  50,000  to 
F    100,000  =  1;  100,000  to  150,000  =  2) 

X_  =  Number  of  railroad  crossings 
G 
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X„  ■  Whether  or  not  parking  is  allowed  on  the  right 
side  of  section  (No  Parking  =*  0;  Parking  =  1) 

XT  «=  The  fifteen  minute  volume  in  number  of  vehicles 
during  the  time  the  speed  is  desired 

X_  =  Lowest  G/C  ratio  of  all  the  signals  included  in 
the  section 

X  =  Whether  or  not  good  progression  of  signals 
exists  (No  =  0;  Yes  -  1). 

Refer  to  Table  4  for  exact  definition  of  each  variable  and 

for  the  limitations. 

Although  the  coefficient  of  multiple  determination 

2 

(R  )  was  not  as  high  as  might  be  desired  and  although 

difficulties  were  encountered  in  the  testing  of  some  of 
the  assumptions,  it  is  believed  that  the  model  as  a 
predictor  has  merit.   Estimates  of  average  running  speed 
within  a  few  miles  per  hour  of  the  true  value  will  be 
sufficient  for  most  planning  and  traffic  engineering  uses. 
The  proposed  model  has  a  standard  error  of  estimate  of 
3.417  mph.   For  ten  examples  the  model  estimated  the 
average  running  speed  within  approximately  four  miles  per 
hour  and  estimated  fifty  percent  of  the  values  within  two 
miles  per  hour. 

As  one  reads  the  list  of  variables  utilized  in  the 
model  it  becomes  noticeable  that  some  of  these  input 
variables  are  identical  to  ones  used  in  capacity 
calculations.   This  is  what  was  desired  at  the  outset  of 
the  project. 
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It  is  of  interest  to  evaluate  the  individual  effects 
on  average  running  speed  of  the  variables.   Section  length 
affects  this  speed  more  than  any  other  variable.   The 
difference  in  running  speed  between  two  sections  whose  only 
difference  is  a  half  mile  in  length  is  about  five  miles 
per  hour.   A  variable  which  substantially  affects  running 
speed  is  volume.   The  speed  is  reduced  1.3  mph  for  every 
one  hundred  vehicle  increase.   Moreover,  if  a  section 
contains  more  than  one  signalized  intersection  with  poor 
progression,  the  reduction  increases  to  3.1  mph  for  the 
same  number  of  vehicles. 

The  population  of  the  city  increases  the  speed  to  a 
greater  extent  than  anticipated.   For  every  fifty  thousand 
people  increase  the  speed  correspondingly  increases  1.3  mph, 
emphasizing  the  fact  that  people  drive  faster  on  this  type 
of  facility  in  larger  cities. 

An  important  fact  contained  in  the  model  is  that  small 
differences  in  the  lowest  G/C  ratio  produce  substantial 
differences  in  the  speed.   As  an  example,  a  section  whose 
lowest  G/C  ratio  is  0.50  would  have  a  predicted  speed 
seven-tenths  of  a  mile  per  hour  slower  than  a  section 
whose  lowest  G/C  ratio  is  0.60.   The  approach  width  of  the 
section  also  substantially  affects  the  average  running  speed. 
The  difference  in  speed  between  a  section  having  a  twelve 
foot  approach  width  and  a  second  section  having  a  twenty- 
four  foot  approach  width  is  nine-tenths  of  a  mile  per  hour. 
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The  addition  of  a  traffic  signal  within  a  section 
decreases  speed  by  about  one-half  mile  per  hour  over  that 
existing  before  installation  of  the  traffic  signal.   A 
further  reduction  results  if  there  are  two  or  more  signals 
in  the  section  and  the  progression  is  unsatisfactory. 
Table  11  is  a  summary  of  this  paragraph. 

To  supplement  the  usefulness  of  the  proposed 
prediction  model,  the  stop  delay  data  collected  was  also 
analyzed.   This  research  project  found  the  major  cause  of 
stop  delay  to  be  the  traffic  signal.   In  this  study  the 
second  most  important  cause  of  stop  delay  was  due  to 
railroad  crossings.   These  delays  increased  as  the  number 
per  section  increased  for  both  the  traffic  conditions  of 
off-peak  and  peak.   The  percentage  of  the  total  stop  delay 
caused  by  traffic  signals  was  eighty  to  eighty-five  percent 
while  the  percentage  caused  by  railroad  crossings  was 
twelve  to  fifteen  percent.   Stop  delay  due  to  other  causes 
was  less  than  three  percent. 

Considering  sections  which  contained  one  traffic 
signal,  the  average  stop  delay  was  9.13  seconds  during 
the  off-peak  period  and  14.62  seconds  for  the  peak  period. 
Other  sections  containing  three  traffic  signals  had 
average  stop  delays  of  14.12  seconds  and  21.56  seconds, 
respectively. 
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TABLE  11 
SUMMARY  OF  SPEED  INFLUENCING  FACTORS 


Variable  Name 
Section  Length 
Approach  Width 
Signalized  Inter. 
Unsignalized  Inter, 
Population 
Volume 


Variable  Increase 

0.1  mile 

12  feet 

1  unit 

1  unit 

50000  people 

100  vehicles 


Speed  Change 
+  1.0  mph 
+0.9  mph 

-  0.7  mph 
-0.2  mph 
+  1.3  mph 

-  3.1  mph 
(poor 
progression) 

-  1.3  mph 
(good 
progression) 


Lowest  G/C  Ratio 


10  percent 


+0.7  mph 
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Using  both  the  prediction  model  for  average  running 
speed  and  the  average  values  of  stop  delay  enables  one  to 
determine  the  location  and  extent  of  improvements  to  the 
travel  time  and  to  estimate  the  benefits  gained  from  them. 
Comparing  the  benefits  with  the  costs  could  indicate 
whether  or  not  the  proposed  improvement  was  worthwhile. 
Alone,  the  prediction  model  would  function  as  a  means  of 
determining  which  frictional  factors  are  reducing  speed 
and  to  what  degree  so  that  corrections  might  be  made. 
Another  function  of  the  model  would  be  the  determination 
of  appropriate  speed  limits  on  urban  arterial  streets. 
Hopefully,  the  knowledge  contained  in  this  report  will  be 
put  to  use  in  the  field  of  traffic  engineering. 
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APPENDIX  A 
DISCUSSION  OF  THE  FOSTER-BURR  Q-TEST 
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THE  Q-TEST  FOR  EQUALITY  OF  VARIANCES* 

by 
Louis  A.  Foster  and  Irving  W.  Burr 


The  Q-test  for  equality  of  variances  is  based  on  a 
statistic  which  is  a  monotone  function  of  the  coefficient 
of  variation  of  the  sample  variances.**  As  such  it  offers 
promise  as  a  preliminary  test  for  the  assumption  of  homo- 
geneity of  population  variances  which  is  needed  in  the 
analysis  of  variance  technique.***  Although  the  Q-test  is 
not  a  so-called  quick  test,  the  test  statistic  is 
sufficiently  simple  to  permit  calculation  of  its  value  on 
a  desk  calculator  (in  contrast  to  the  logarithmic  trans- 
formation required  in  Bartlett's  test).   A  sample  variance 
taking  the  value  zero  does  not  disrupt  this  test  (as  it 
does  to  Bartlett's  and  Hartley's  tests).   Finally,  a  table 
of  critical  values  permits  the  use  of  this  test  for  small 
sample  sizes  where  the  use  of  an  asymptotic  distribution 
would  not  be  appropriate. 

THE  Q-STATISTIC:   For  equal  sample  sizes,  n,  from  each  of 
p  parent  populations,  let  s?  (for  j  =  l,...,p)  denote  the 
3th  sample  variance.   Denoting  the  value  of  the  test 
statistic  Q  by  q,  we  have: 

/  4  4W/  2  2.2 

q  ■  (s1  +  ...  +  s  )/(s1  +  ...  +  s  )  . 

For  unequal  sample  sizes  we  specify  that  each  sample 
variance,  s2,  be  calculated  by  dividing  by  the  degrees  of 
freedom,  Vj,  rather  than  by  the  sample  size,  nj  (where 


*This  brief  summary  of  the  Q-test  is  based  on  a  Ph.D. 
thesis  submitted  to  Pursue  University  by  Louis  A.  Foster 
working  under  the  supervision  of  Irving  W.  Burr. 

♦♦Equivalent  tests  have  been  proposed  by  Brandt  and 

Stevens,  but  in  a  form  depending  solely  on  an  asymptotic 
distribution  for  the  test  statistics. 

***This  statement  is  based  on  the  role  of  the  coefficient 
of  variation  of  the  population  variances  in  measuring 
the  disruption  of  the  P-ratio  in  the  analysis  of 
variance  technique,  as  set  forth  by  Box  (6)  Vol.  25, 
pages  290  and  484. 
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Vj  =  nj 


-  1,  for  j  =  l,...,p).   Let  v  denote  the  arithmetic 
average  of  the  degrees  of  freedom.   In  this  case  we  have: 

4 


q  =  v 


(vlSf  ' 


+  v  s4)/(v 

p  p 


Is? + 


+  v  s2)2 

p  p 


THE  Q-TEST:   Large  values  of  Q  lead  to  rejection  of  the 
hypothesis  of  equal  population  variances.   The  critical 
values  are  given  in  the  attached  table  for  various  numbers 
of  parent  populations,  p,  and  various  possibilities  for 
equal  degrees  of  freedom,  v  (where  v  =  n  -  1) ,  from  one  to 
ten.   This  table  can  be  used  directly  for  equal  degrees 
of  freedom.   For  unequal_degrees  of  freedom,  q  is  calculated 
as  indicated  above,  but  v  is  to  be  substituted  for  v  in  the 
attached  table ,  provided  that  v  and  the  harmonic  mean  of 
the  v's  do  not  differ  greatly.   For  large  sample  sizes 
(v  >  10),  we  note  that  p  v  (pQ-l)/2  is  asymptotically  chi- 
square  with  (p-1)  degrees  of  freedom. 
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Calculations  for  the  Kolmogorov-Smirnov  Test 
for  the  Normality  of  the  Section  Speeds 

Critical  D  for  1520  samples  (Level  of  Significance  =  0.01) 


D  =  1.031     (12) 


D  =  1.031      =   0.0264 
(1520)1/2 

Maximum  Deviation  =  0.0487 

0.0487  >  0.0264 

Therefore,  reject  hypothesis  that  the  curve  is  normal. 
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Claculations  for  the  Kolmogorov-Smirnov  Test 
for  the  Normality  of  the  Residuals 

Critical  D  for  1520  samples  (Level  of  Significance  =  0.05) 


D   =  0.886      (12) 


D  =  0.886      =   0.0228 
(1520)1/2 

Maximum  Deviation  =  0.0096 

0.0096  <  0.0228 


Therefore,  cannot  reject  hypothesis  that  the  curve  is 
normal. 


